Evaluating Human and Automated Generation of Distractors for Diagnostic Multiple-Choice Cloze Questions to Assess Children's Reading Comprehension

نویسندگان

  • Yi-Ting Huang
  • Jack Mostow
چکیده

We report an experiment to evaluate DQGen’s performance in generating three types of distractors for diagnostic multiple-choice cloze (fill-in-theblank) questions to assess children’s reading comprehension processes. Ungrammatical distractors test syntax, nonsensical distractors test semantics, and locally plausible distractors test inter-sentential processing. 27 knowledgeable humans rated candidate answers as correct, plausible, nonsensical, or ungrammatical without knowing their intended type or whether they were generated by DQGen, written by other humans, or correct. Surprisingly, DQGen did significantly better than humans at generating ungrammatical distractors and slightly better than them at generating nonsensical distractors, albeit worse at generating plausible distractors. Vetting its output and writing distractors only when necessary would take half as long as writing them all, and improve their quality.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigating the Relatedness of Cloze-Elide Test, Multiple-Choice Cloze Test, and C-test as Measures of Reading Comprehension

Reading comprehension ability consists of multiple cognitive processes, and cloze tests have long been claimed to measure this ability as a whole. However, since the introduction of cloze test, different varieties of it have been proposed by the testers. Thus, the present study was an attempt to examine the relatedness of Cloze-Elide test, Multiple-choice (MC) cloze test, and C-test as three di...

متن کامل

Using Automated Questions to Assess Reading Comprehension, Vocabulary, and Effects of Tutorial Interventions

We describe the automated generation and use of 69,326 comprehension cloze questions and 5,668 vocabulary matching questions in the 2001-2002 version of Project LISTEN’s Reading Tutor used by 364 students in grades 1-9 at seven schools. To validate our methods, we used students’ performance on these multiple-choice questions to predict their scores on the Woodcock Reading Mastery Test. A model ...

متن کامل

Can Automated Questions Scaffold Children's Reading Comprehension?

Can automatically generated questions scaffold reading comprehension? We automated three kinds of multiple-choice questions in children’s assisted reading: 1. Whquestions: ask a generically worded What/Where/When question. 2. Sentence prediction: ask which of three sentences belongs next. 3. Cloze: ask which of four words best fills in a blank in the next sentence. A within-subject experiment i...

متن کامل

The Relationship between Translation Tests and Reading Comprehension: A Case of Iranian University Students

The present study seeks to investigate the potentiality of the translation task as a testing method for measuring reading comprehension. To achieve this objective, two types of translation tests, open-ended and multiple-choice tests, and two types of reading comprehension tests, multiple-choice reading comprehension and open-ended cloze tests were developed in this study. The reliability of the...

متن کامل

Generating Diagnostic Multiple Choice Comprehension Cloze Questions

This paper describes and evaluates DQGen, which automatically generates multiple choice cloze questions to test a child’s comprehension while reading a given text. Unlike previous methods, it generates different types of distracters designed to diagnose different types of comprehension failure, and tests comprehension not only of an individual sentence but of the context that precedes it. We ev...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015